A Restless Bandit Model for Resource Allocation, Competition, and Reservation

J Fu; B Moran; PG Taylor

Journal article

A Restless Bandit Model for Resource Allocation, Competition, and Reservation

J Fu, B Moran, PG Taylor

Operations Research | INFORMS | Published : 2022

DOI: 10.1287/opre.2020.2066

Abstract

We study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. It is modeled as a set of heterogeneous restless multiarmed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle's relaxation idea and Weber and Weiss' asymptotic optimality proof, we propose a simple policy and prove it to be asymptotically optimal in a regime where both arrival rates and capacities increase. We provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be check..

View full abstract

University of Melbourne Researchers

William Moran Author

Peter Taylor Author

Related Projects (1)

NEW STOCHASTIC MODELS FOR SCIENCE, ECONOMICS, SOCIAL SCIENCE AND ENGINEERING

Stochastic, or random, phenomena abound in society. This project will combine advancement of the theory of stochastic models at a deep leve..

Grants

Awarded by Australian Research Council

Funding Acknowledgements

J. Fu's and P. Taylor's research is supported by the Australian Research Council (ARC) Centre of Excellence for the Mathematical and Statistical Frontiers (ACEMS) and ARC Laureate Fellowship [Grant FL130100039].